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(1) GENERAL INFORMATION: 

(i) APPLICANT: NAG AT A, Shigekazu 
ITOH, Naoto 
YONEHARA/ Shin 

(ii) TITLE OF INVENTION : DNA CODING FOR HUMAN CELL SURFACE 
ANTIGEN 

(iii) NUMBER .OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: oto ^ u TTD 

(A) ADDRESSEE: BIRCH, STEWART, KOLASCH & BIRCH, LLP. 

(B) STREET: P.O. BOX. 74 7 

(C) CITY: FALLS CHURCH 

(D) STATE: VA 

(E) COUNTRY: USA 

(F) ZIP: 22040-0747 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
(li) OOMPUTKR: T.I1M PC com pa t. .1 b I o 

2 (C) OPERATING SYSTEM: PC- DOS /MS- DOS 

P3 (D j SOFTWARE: Patentln Release #1.0, Version #1.30 
03 

•P (vi) CURRENT APPLICATION DATA: 

i£j (A) APPLICATION NUMBER: 08/468,560 

(B) FILING DATE: 06-JUN-1995 
\j (C) CLASSIFICATION: 

n (viii) ATTORNEY/AGENT INFORMATION: 

h (A) NAME: MURPHY JR., GERLAD M. 

0* (B) REGISTRATION NUMBER: 28,977 

FU (C) REFERENCE/DOCKET NUMBER: . 20-4393P 

P (ix) TELECOMMUNICATION INFORMATION: 

M , (A) TELEPHONE: 703-205-8000 

' ' (B) TELEFAX: 703-205-0050 



(2) INFORMATION FOR SEQ ID NO:l: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDRDNRSS: double 

(D) TOPOLOGY: lirtoar 

(ii) MOLECULE TYPE: cDNA to mRNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 195.. 1202 

(ix) FEATURE: 
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(A) NAME/ KEY : sig_peptide 

(B) LOCATION: 195.. 242 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 243.. 1199 

(ix) FEATURE: 

(A) NAME/KEY: polyA_site 

(B) LOCATION: 1831.. 1036 

(ix) FEATURE: 

(A) NAME /KEY : polyA_site 

(B) LOCATION: 2352.-2357 

(ix) FEATURE: 

(A) NAME /KEY : polyA_site 

(B) LOCATION: 2518. .2532 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GACGCTTCTG GGGAGTGAGG GAAGCGGTTT ACGAGTGACT TGGCTGGAGC CTCAGGGGCG 
GGCACTGGCA CGGAACACAC CCTGAGGCCA GCCCTGGCTG CCCAOGCGGA GCTGCCTCTT 
CTCCCGCGGG TTGGTGGACC CGCTCAGTAC GGAGTTGGGG AAGCTCTTTC ACTTCGGAGG 

5 ' ATTGCTCAAC AACC ATG CTG GGC ATC TGG ACC CTC CTA CCTCTG GTT CTT 
tJ ATTGCTCAAC AACC ^ ^ ^ ^ ^ ^ ^ prQ Leu Val L 

m -i6 -is ■■■-lo - 5 

7 ACG TCT GTT GCT AGA TTA TCG TCC AAA AGT GTT AAT GCC CAA GTG ACT 

O Thr Ser Val Ala Arg Leu Ser Ser Lys Ser Val Asn Ala Gin Val Thr 

.jjj • -1-5 

^ TAP ATC AAC TCC AAG GGA TTG GAA TTG AGG AAG ACT GTT ACT ACA GTT 326 

>* Asp ?Ie !Sn Ser JJ2 Gly Leu Glu Leu Arg Lys Thr Val Thr Thr Val 

0 15 20 25 

^ • GAG. ACT CAG AAC TTG GAA GGC CTG CAT CAT GAT GGC CAA TTC TGC CAT 
Glu Thr Gin Asn Leu Glu Gly Leu His His Asp Gly Gin Phe Cys His 
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anr rrr TCT CCT CCA GGT GAA AGG AAA GCT AGG GAC TGC ACA GTC AAT 
£s Pro lys So Pro Gly Glu Arg Lys Ala Arg Asp. Cys Thr Val Asn 



45 50 55 



60 
120* 
100 
230 

278 



374 



422 



470 



GGG GAT GAA CCA GAC TGC GTG CCC TGC CAA GAA GGG AAG GAG TAC ACA 
Gly Asp Glu Pro Asp Cys Val Pro Cys Gin Glu Gly Lys Glu Tyr Thr 
65 70 

GAC AAA GCC CAT TTT TCT TCC AAA TGC AGA AGA TGT AGA TTG TGT GAT 
Asp Lys Ala His Phe Ser Ser Lys Cys Arg Arg Cys Arg Leu Cys Asp 
80 8 5 90 

GAA GGA CAT GGC TTA GAA GTG GAA ATA AAC TGC ACC CGG ACC CAG AAT 
Glu Gly His Gly Leu Glu Val Glu lie Asn Cys Thr Arg Thr Gin Asn 
95 100 105 

ACC AAG TGC AGA TGT AAA CCA AAC TTT TTT TGT AAC TCT ACT GTA TGT 614 



510 



566 
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CAT GGA AAG AAA GAA GCG TAT GAC ACA TTG ATT AAA GAT CTC AAA AAA 1091 
Mis Gly Lya Lys Glu Ala Tyr Asp Thr Leu He Lya Asp Leu Lya Lys 
' . 270 275 2H0 

GCC AAT CTT TGT ACT CTT GCA GAG AAA ATT CAG ACT ATC ATC CTC AAG 1112 
Ala Asn Leu Cys Thr Leu Ala Glu Lys lie Gin Thr He He Leu Lys 
285 290 295 300 

GAC ATT ACT AGT GAC TCA GAA AAT TCA AAC TTC AGA AAT GAA ATC CAA 1190 
Asp He Thr Ser Asp Ser Glu Asn Ser Asn Phe Arg Asn Glu He Gin 
305 310 315 

AGC TTG GTC TAG AGTGAAAAAC AACAAATTCA GTTCTGAGTA TATGCAATTA 1242 
Ser Leu Val 

GTGTTTGAAA AGATTCTTAA TAGCTGGCTG TAAATACTGC TTGGTTTTTT ACTGGGTACA 1302 

TTTTATCATT TATTAGCGCT GAAGAGCCAA CATATTTGTA GATTTTTAAT ATCTCATGAT 1362 

TCTGCCTCCA AGGATGTTTA AAATCTAGTT GGGAAAACAA ACTTCATCAA GAGTAAATGC 1422 



Thr Lya Cyu Arrj Cyu Ly:i L'ro A:iti I'lio I'lio Cy:i A:m Hor Thr Vti.1. Cy.'i • 
HO 115 120 

GAA CAC TGT GAC CCT TGC ACC AAA TGT GAA CAT GGA ATC ATC AAG GAA 662 
Glu His Cys Asp Pro Cys Thr Lys Cys Glu His Gly He He Lys Glu 
125 130 135 140 

TGC ACA CTC ACC AGC AAC ACC AAG TGC AAA GAG GAA GGA TCC AGA TCT 710 
Cys Thr Leu Thr Ser Asn Thr Lys Cys Lys Glu Glu Gly Ser Arg Ser 
145 150 155 

AAC TTG GGG TGG CTT TGT CTT CTT CTT TTG CCA ATT CCA CTA ATT GTT 758 
Asn Leu Gly Trp Leu Cys Leu Leu Leu Leu Pro He Pro Leu He Val 
160 165 170 

TGG G'l'G AAG AGA AAG GAA G'l'A GAG AAA AGA TGG AGA AAG GAG AGA AAG flOfi 
Trp Val Lya Arg Lya Glu Val G.l.n Lya Thr Cys Arcj Ly:j Ilia Arg Lya 
175 180 185 

GAA AAC CAA GGT TCT CAT GAA TCT CCA ACC TTA AAT CCT GAA ACA GTG 854 
Glu Asn Gin Gly Ser His Glu Ser Pro Thr Leu Asn Pro Glu Thr Val 
190 195 200 



O GCA ATA AAT TTA TCT GAT GTT GAC TTG AGT AAA TAT ATC ACC ACT ATT 

& Ala lie Asn Leu Ser Asp Val Asp Leu Ser Lys Tyr He Thr Thr He 

m 205 210 215 220 
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£ GCT GGA GTC ATG ACA CTA AGT CAA GTT AAA GGC TTT GTT CGA AAG AAT 950 

iD ' Ala Gly Val Met Thr Leu Ser Gin Val Lys Gly Phe Val Arg Lys Asn 
J 225 230 235 

GGT GTC AAT GAA GCC AAA ATA GAT GAG ATC AAG AAT GAC AAT GTC CAA 998 
Gly Val Asn Glu Ala Lys He Asp Glu lie. Lys Asn Asp Asn Val Gin 
240 245 250 

GAC ACA GCA GAA CAG AAA GTT CAA CTG CTT CGT AAT TGG CAT CAA CTT 104 6 

Asp Thr Ala Glu Gin Lys Val Gin Leu Leu Arg Asn Trp His Gin Leu 
255 260 2f.ri 
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AGTGGCATGC TAAGTACCCA AATAGGAGTG TATGCAGAGG ATGAAAGATT AAGATTATGC 
TCTGGCATCT AACATATGAT TCTGTAGTAT GAATGTAATC AGTGTATGTT AGTACAAATG 
TCTATCCACA GGCTAACCCC ACTCTATGAA. TCAATAGAAG AAGCTATGAC CTTTTGCTGA 
AATATCAGTT ACTGAACAOG CAGGCCACTT TCCCfCTAAA TTAOCTCTGA 'I'AATTCTAGA 
GATTTTACCA TATTTCTAAA CTTTGTTTAT AACTCTGAGA AGATCATATT TATGTAAAGT 
ATATGTATTT GAGTGCAGAA TTTAAATAAG GCTCTACCTC AAAGACCTTT GCACAGTTTA 
TTGGTGTCAT ATTATACAAT ATTTCAATTG TGAATTCACA TAGAAAACAT TAAATTATAA 
TGTTTGACTA TTATATATGT GTATGCATTT TACTGGCTCA AAACTACCTA CTTCTTTCTC 
AGGCATCAAA AGCATTTTGA GCAGGAGAGT ATTAGTAGAG CTTTGCCACC TCTCCATTTT 
TGCCTTGGTG CTCATCTTAA TGGCCTAATG CACCCCCAAA CATGGAAATA TCACCAAAAA 
ATACTTAATA GTCCACCAAA AGGCAAGACT GCCCTTAGAA ATTCTAGCCT GGTTTGGAGA 
TACTAACTGC TCTCAGAGAA AGTAGCTTTG TGACATGTCA TGAACCCATG TTTGCAATCA 
AAGATGATAA AATAGATTCT TATTTTTCCC CCACCCCCGA AAATGTTCAA TAATGTCCCA 
TGTAAAACCT GCTACAAATG GCAGCTTATA CATAGCAATG GTAAAATCAT CATCTGGATT 
•TAGGAATTGC TCTTGTCATA CCCTCAAGTT TCTAAGATTT AAGATTCTCC TTACTACTAT 
CCTACGTTTA AATATCTTTG AAAGTTTGTA TTAAATGTGA ATTTTAAGAA ATAATATTTA 
TATTTCTGTA AATGTAAACT GTGAAGATAG TTATAAACTG AAGCAGATAC CTGGAACCAC 
. CTAAAGAACT TCCATTTATG GAGGATTTTT TTGCCCCTTG TGTTTGGAAT TATAAAATAT 
AGGTAAAAGT ACGTAATTAA ATAATGTTTT TG 

(2) INFORMATION FOR SEQ ID NO: 2: 

: ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Gly He Trp Thr Leu Leu Pro Leu Val Leu Thr Ser Val Ala 

-10 "° 



-16 -15 



Arg Leu Ser Ser Lys Ser Val Asn Ala Gin Val Thr Asp lie Asn Ser 

1 5 10 

Lys Gly Leu Glu Leu Arg Lys Thr Val Thr Thr Val Glu Thr Gin Asn 

20 25 
Leu Glu Gly Leu His His As P Gly Gin Phe Cys His Lys Pro Cys Pro 

r>c 40 H3 



1482 

1542 

1602 

1G63 

1722 

17,82 

1842 

1902 

1962 

2022 

2082 

2142 ; 

2202 

2262 

2322 

2382 

2442 

2502 

2534 
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Pro Gly Glu Arg Lys Ala Arg Asp Cys Thr Val Asn Gly Asp Glu Pro 
50 55 60 

Asp Cys Val Pro Cys Gin Glu Gly Lys Glu Tyr Thr Asp Lys Ala His 
65 ''0 75 00 

Phe Ser Ser Lys Cys Arg Arg Cys Arg Leu Cys Asp Glu Gly His Gly 
85 90 

Leu Glu Val Glu lie Asn Cys Thr Arg Thr Gin Asn Thr Lys Cys Arg 
100 105 HO 

Cys Lys Pro Asn Phe Phe Cys Asn Ser Thr Val Cys Glu His Cys Asp 
115 120 125 

Pro Cys Thr Lys Cys Glu His Gly He lie Lys Glu Cys Thr Leu Thr 
130 135 I 40 

Ser Asn Thr Lys Cys Lys Glu Glu Gly Ser Arg Ser Asn Leu Gly Trp 

145 150 155 

Leu Cys Leu Leu Leu Leu Pro lie Pro Leu He Val Trp Val Lys Arg 
165 170 I 75 

Lys Glu Val Gin Lys Thr Cys Arg Lys His Arg Lys Glu Asn Gin Gly 
180 185 19° 

Ser His Glu Ser Pro Thr Leu Asn Pro Glu Thr Val Ala He Asn Leu 
195 200 205 

Ser Asp Val Asp Leu Ser Lys Tyr He Thr Thr He Ala Gly Val Met 
210 215 220 

Thr Leu Ser Gin Val Lys Gly Phe Val Arg Lys Asn Gly Val Asn Glu 
225 230 ' 235 240 

Ala Lys He Asp Glu He Lys Asn Asp Asn Val Gin Asp Thr Ala Glu 
245 250 255 

Gin Lys Val Gin Leu Leu Arg Asn Trp His Gin Leu His Gly Lys Lys 
.. 260 265 270 

Glu Ala Tyr Asp Thr Leu He Lys Asp Leu Lys Lys Ala Asn Leu Cys 
275 280 285 • 

Thr Leu Ala Glu Lys He Gin Thr He He Leu Lys Asp He Thr Ser 
290 295 300 

Asp Ser Glu Asn Ser Asn Phe Arg Asn Glu He Gin Ser Leu Val 
305 310 ■. 315 



(2) INFORMATION FOR SBQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(11.) MOI.KCUI.K TYLM: - ,: I u 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 : 

Gin Asn Leu Glu Gly Leu His His Asp Gly Gin Phe Cys His Lys Pro 
15 10 15 

Cys Pro Pro Gly Glu Arg Lys Ala Arg Asp Cys Thr Val Asn Gly Asp 
20 25 30 

Glu Pro Asp Cys Val Pro Cys Gin Glu Gly Lys Glu Tyr Thr Asp Lys 

35 40 . . 45 

Ala His Phe Ser Ser Lys Cys Arg Arg Cys Arg Leu Cys Asp Glu Gly 
50 55 60 

His Gly Leu Glu Val Glu lie Asn Cys Thr Arg Thr Gin Asn Thr Lys 
65 70 75 80 

Cys Arg Cys Lys Pro Asn Phe Phe Cys. Asn Ser Thr Val Cys Glu His 
G 85 . 90 95 

I Cys Asp Pro Cys Thr Lys Cys Glu His. Gly He He Lys Glu Cys Thr 

g 100 105 HO 

f ~ 

' Leu Thr Ser Asn Thr Lys Cys 

*B 115 

w 

SI (2) INFORMATION FOR SEQ ID NO: 4: 

B 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
H 1 (D) TOPOLOGY: linear 



! : 



(il) MOI.,l ;, .CUM!'. TYPE: protoi.n 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 4 : 

Val Cys Pro Gin Gly Lys Tyr lie His Pro Gin Asn Asn Ser lie Cys 
1 5 10 15 

Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly 
20 25 30 

Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr 
35 40 45 

Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser Lys Cys Arg 
50 55 60 

Lys Glu Met Gly Gin Val Glu lie Ser Ser Cys Thr Val Asp Arg Asp 
65 70 . 75 80 
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Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu 
85 90 95 

Asn Leu Phe Gin Cys Phe Asn Cys Ser Leu Cys Leu Asn Gly Thr Val 
100 105 110 

His Leu Ser Cys Gin Glu Lys Gin Asn Thr Val Cys Thr Cys His Ala 
115 120 125 

Gly Phe Phe Leu Arg Glu Asn Glu Cys Val Ser Cys Ser Asn Cys Lys 
130 135 140 

Lys Ser Leu Glu Cys Thr Lys Leu Cys 
145 .150 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
.( I)) TOPOLOGY: .U.nonr 

(ii) MOLECULE TYPE: protein 



=P (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

m Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin Thr Ala Gin Met Cys Cys 

5 1 5 10 15 

jU Ser Lys Cys Ser Pro Gly Gin His. Ala Lys Val Phe Cys Thr Lys Thr 

W 20 25 30 

m 

fU Ser Asp Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu 

H» 35 40 45 

M> Trp Asn Trp Val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Asp 

,50 55 60 

Asp Gin Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg He Cys 
65 70 .75 80 

Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu Ser Lys Gin Glu Gly Cys 
85 90 95 

Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg Pro Gly Phe Gly Val Ala 
100 105 110 

Arg Pro Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro 
115 120 125 

Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr Asp He Cys Arg Pro His 
130 135 140 

Gin He Cys Asn Val Val Ala He Pro. Gly Asn Ala Ser Met Asp Ala 
145 150 155 160 

Val Cys Thr 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) GKOUKNCK DK.'.iCIU I.T I ON : SKO I'D NO: 6: 

Ala Cys Pro Thr Gly Leu Tyr Thr His Ser Gly Glu Cys Cys Lys Ala 

1 5 10 15 

Cys Asn Leu Gly Glu Gly Val Ala Gin Pro Cys Gly Ala Asn Gin Thr 

20 25 30 

P Val Cys Glu Pro Cys Leu Asp Ser Val Thr Ser Ser Asp Val Val Ser 



35 40 45 



& Ala Thr Glu Pro Cys Lys Pro Cys Thr Glu Cys Val Gly Leu Gin Ser 

Hi 50 55 60 

h Met Ser Ala Pro Cys Val Glu Ala Asp Asp Ala Val Cys Arg Cys Ala 

m 65 70 75 80 

M 

a Tyr Gly Tyr Tyr Gin Asp Glu Thr Thr Gly Arg Cys Glu Ala Cys Arg 

n 85 90 95 



Val Cys Glu Ala Gly Ser Gly Leu Val Phe Ser Cys Gin Asp Lys Gin 
100 105 HO 

Asn Thr Val Cys Glu Glu Cys Pro Asp Gly Thr Tyr Ser Asp Glu Ala 
115 120 125 

: 'Asn His Val Asp Pro Cys Leu Pro Cys Thr Val Cys Glu Asp Thr Glu 
130 135 140 

Arg Gin Leu Arg Glu Cys Thr Arg ■ Trp Ala Asp Ala Glu Cys Glu 
• 145 . 150 155 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
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Ala Cys Arg Glu Lys Gin Tyr Leu He Asn Ser Gin Cys Cys Ser Leu 

1.5. 10 15 

Cys Gin Pro Gly Gin Lys Leu Val Ser Asp Cys Thr Glu Phe Thr Glu 
20 25 30 

Thr Glu Cys Leu Pro Cys Gly Glu Ser Glu Phe Leu Asp Thr Trp Asn 
35 10 45 

Arg Glu Thr His Cys His Gin His Lys Tyr Cys Asp Pro Asn Leu Gly 
50 55 60 

Leu Arg Val Gin Gin Lys Gly Thr Ser Glu Thr Asp Thr He Cys Thr 
65 70 .75 80 

Cys Glu Glu Gly Trp His Cys Thr Ser Glu Ala Cys Glu Ser Cys Val 
85 90 95 

Leu His Arg Ser Cys Ser Pro Gly Phe Gly Val Lys Gin He Ala Thr 
100 105 HO 

Gly Val Ser Asp Thr He Cys Glu Pro Cys Pro Val Gly Phe Phe Ser 
115 120 . 125 

Asn Val Ser Ser Ala Phe Glu Lys Cys His Pro Thr Ser Cys Glu Thr. 
130 135 1^0 

Lys Asp Leu Val Val Gin Gin Ala Gly Thr Asn Lys Thr Asp Val Val 
145 150 155 160 

Cys Gly 



(2) INFORMATION FOR SEQ ID NO: 8: 

H> (i) SEQUENCE CHARACTERISTICS: 

Q (A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Asn Cys Val Lys Asp Thr Tyr Pro Ser Gly His Lys Cys Cys Arg Glu 
1 5 10 15 

Cys Gin Pro Gly Wis Gly Mot: Val Sor Arg Cyn Aap 1 1. In Thr Arg Asp 
20 25 30 

Thr Val Cys His Asn Cys Val Lys Asp Thr Tyr Pro Ser Gly His Lys 

35 40 45 

Cys Cys Arg Glu Cys Gin Pro Gly His Gly Met Val Ser Arg Cys Asp 
50 55 60 
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at 



His Thr Arg Asp Thr Val Cys His Cys Arg Pro Gly Thr Gin Pro Arg 
65 70 

Gin Asp Ser Ser His Lys Phe Gly Val Asp Cys Val Pro Cys P.ro Pro 

85 90 95 

Gly His Phe Ser Pro Gly Ser Asn Gin Ala Cys Lys Pro Trp Thr Asn 

100 i° 5 110 

Cys Thr Leu Ser Gly Lys Gin lie Arg His Pro Ala Ser Asn Ser Leu 



115 120 

Anp Thr V.1.1 Cy:i (.'..In 
130 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Lys Ala Pro His Pro Lys Gin Glu Pro Gin Glu lie Asn Phe Pro Asp 
1 5 10 

Asp Leu Pro Gly Ser Asn Thr Ala Ala Pro Val Gin Glu Thr Leu His 

20 * u 

Gly Cy ? i Gin I'ro V....I. Thr Gin Glu Am,) Gly l.y.'. Glu M»r 



(2) INFORMATION FOR SEQ ID. NO: 10 : 

: (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Lys Ala Pro His Pro Lys Gin Glu Pro Gin Glu He Asn Phe Pro Asp 
1 5 10 • 

Asp Leu Pro Gly Ser Asn Thr Ala Ala Pro Val Gin Glu Thr Leu His 
20 25 3° 

Gly Cys Gin Pro Val Thr Gin Glu Asp Gly Lys Glu Ser 
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35 



40 



45 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Lvs Ala Pro His Pro Lys Gin Glu Pro Gin Glu lie Asn Phe Pro Asp 
1 5 10 15 

Asp Leu Pro G.l.y Mor: A«n Thr A.l.,i AUi Pro V.U. Gin c;iu Thr Lou III.m 
?A) M) 

«% Gly Cys Gin Pro Val Thr Gin Glu Asp Gly Lys Glu Ser 

W 35 40 45 . 

§ 

£ v 
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